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Abstract 

Time-continuous Chain and Multi-spacial Markov chains of time dependence of allele manifestation in cancer are newly written. The allele manifestation is 
associated with ’non-cancer states’ and ’cancer states’ which constitute nodes (and pathways). The allele manifestation is described as directed graphs (on the 
opportune manifold), which are modeled according to Markov Decision Processes. The measure from the algebra on the manifold of the directed graphs is newly 
proven to be matched with that of the Kantorovich metric on the Borel (sub-)sets of the Markov states space. The expectation of the reward is calculated in the 


case of ’non-cancer states’ and in that of ’cancer states’; the scheme is ready for a path-integral approach. 
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Introduction 

In [1], the transition probabilities of objects (i.e. ones from biological samples endowed with a measure from a o-algebra (within the framework of 
i.e. tissues, cells, sequences) from the ’non-cancer state’ to the ’cancer states’ semigroupoids) [8]; as anew consequence, the Kantorovich measure is newly 
are described for neighboring objects as far as both distance and state of the demonstrated to apply to the directed graphs which admit measure theory 
neighbors are concerned. from a o-algebra. The role of the propagated experimental errors is therefore 
The manifestation of the alleles is described in [2]. newly outlined. 

In [1], the presence of uncertainties is mentioned, from which the standard Furthermore, from [3], the QoS (Quality of Service) metrics are proposed in 
deviations can be newly calculated; nevertheless, no numerical values are order for the metrics to consist of the sum of probabilities. This method might 
reported, i.e. such that it is not possible, within the presented data, to assess apply to the comments in [5]. Within the present analysis, unaccordingly, the 
more specific Markov models. chosen measure is the Kantorovich measure. The expectation value of the 
The uncertainties on the identification of objects depending on the loci of the reward will be expressed also from the definition of the first hitting time. 
alleles within the Markov models are further discussed in [2]. The directed graphs allow one to use the path-integral approach in Markov 
The existence of experimental uncertainties due to the error propagation is models. The paper is organized as follows. 

recalled to be consistent with the Kantorovich measure from the Kantorovich In Section 3, the allele manifestation is defined as nodes within ’non-cancer 
metric of Markov Decision Processes. states’ and ’cancer states. The transition from the different states is recalled 
The possibility to define metrics and algebras is studied in [3]. The to define pathways. The pathways are framed within the Markov Decision 
calculation of the reward in [4] is here newly analytically proposed for ’non- Process. The pathways are identified with directed graphs. The Kantorovich 
cancer states’ and ’cancer states’. measure is reminded. The directed graphs are reviewed on the opportune 
The standard deviation of the experimental apparati and techniques are manifold endowed with a metric from a o-algebra from the groupoid 
recapitulated in [5] and in [6]; as a result, the ’similarity’ of states obtained technique. In Section 4, a new choice of the probability matrix of the Markov 
after the use of the Kantorovich metric is here understood to be consistent for chain of cancer progression is newly taken and explained. 

a broader class of phenomena; such phenomena were previously sampled as In Section 5, a new time-continuous chain inspired by the mover-stayer 
Monte Carlo [5]. Moreover, the distances for the first-order Markov process model is analytically written for the allele manifestation methods. 

extracted from the stayer-mover scheme are framed within the analysis of [5]. In Section 6, the allele manifestation phenomena are proposed as Markov 
Starting from the description in [17], the allele manifestation is in the present Decision 

work newly written as a landscape of oriented graphs with measure theory Processes. The allele manifestation is identified with paths that correspond to 
from a o-algebra. More in detail, the directed graphs [7] are recalled to be directed graphs; the Kantorovich metric is newly proven to the manifold of 
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the directed graphs as, in the latter case, a measure issued from a o-algebra manifestation and measure theory on directed graphs is prepared. The 
on Borel (sub-)set holds. expectation of the rewards is calculated in the *non-cancer Markov states’ 
In Section 7, a new mathematical application for the nodes as loci of the allele and in the ’cancer Markov states’. The results are discussed in Section 8. 


Introductory Material 

The Markov Models 

In [1], the definition of ’node points’ which have to be referred to as ’non-cancer states’ and ’cancer states’ is proposed. 

The existence of uncertain sources is pointed out. 

N is called the nodal points, with N = 1, N 

This way, the set N consists of Nij= {j € N |/ is the nearest neighbor of the node i}. Each one of the nodes can be either in the ’non-cancer state’ or in the ’cancer 
state’; the items of information are encoded within the ’switch’ parameters S;,i.e. such that S;= 0 corresponds to the description according to which the node 7 is 
in the non-cancer state, and S;= 1 corresponds to the fact that the node / is in the cancer state. 

There exists a ’memory-less’ exponential probability distribution with probability rates denominated Aj, which evolve in the time variable t as A(t). The 


probability distribution f(s,A;) depends on the time variables ¢ and s such as 


f(s, Ai) = Nie MOIGSD, 5 >. (1) 


The probability p to pass from the mode {S;= 0} to the mode {S;= 1} is therefore obtained as 
P(Sit +7) =1|S() =0) =~ 1-e NOT ~ Air + O(Ait)* (2) 


The probability matrix Pz over the time interval z is 


A 


Pr= (3) 
1—A(t)r 0 
| Att 1 
The probabilities of finding cancer states close to non-cancer states can be calculated by composition after Eq. (3). 


From [9], the irreversible cancer process is studied from the ’mover-stayer model’ described in [10]. 


The time-dependent probability matrix P(T) is decomposed as 
P (T) =8+(U-8)M(T) (4) 


In this study, a probability matrix is issued from the complete ’mover-stayer model’; indeed, the complete ’mover-stayer’ model does not correspond to a Markov 
process: nevertheless, a first-order Markov process is obtained in [9]. For these purposes, the Sojourn Time is defined as the lapse of time during which the tumor 
is staying in the ’pre-clinical’ state, but the phase is’ screen detectable’. 
Accordingly, the Mean Sojourn Time is the time during which there is the possibility of diagnosis. 
From [9] page. 267 ibidem, the ’malignancy grades’ are defined and here reported as 
1. No detectable disease. 
2. Pre-clinical screen-detectable grade 1-2. 
3. Pre-clinical screen-detectable grade 3. 
4. Clinical grade 1-2. 
5. Clinical grade 3. 
A Markov chain X(t) is described as a continuous-time T Markov chain, whose states are denominated Sj. 
The index i = 0,1,2 is explicated as 
i= 0 no detectable disease; 
i= 1 pre-clinical screen-detectable disease; 
i = 2 clinical disease. 


The index j is specified as 


j= 11-2 grade: 
j=23 grades. 
The Markov chain X(7) is defined on the state space Q(7) with 
Q(T) = {Soo, S11, Si2, S21, S22}. (5) 
The matrices in Eq. (4) are here spelled out. 
The matrix *s is a diagonal matrix, whose entries s;;, i= 1,2,.....,5are proportional to the ’stayer’ of each of the states as 
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S= (6) 


e 0 0 0 


Fie eo 
°o 
o 
°o 


The matrix M, is defined as 


M\= (7) 


The matrix M, is defined as 


M2= (8) 


Each columns of the two matrices M1 and M2 represent the possibilities {So0,511,.512,521,522} , respectively. 


From these objects, the first-order Markov process X” is issued. 


3.2 About the Bounding of the Probabilistic Metrics 
The determination of the convergence of a probability metric is studied in [11]. Let Q be a measurable space, with a o-algebra on the Borel subset B; furthermore, 


let X be the space of all the probability measures on (Q, B). Several notions of distances can be considered in X. 


Metrics in probability spaces are schematized in Table 1 ibidem. 


In Figure 1 ibidem, the relations between the metrics and the triples are studied accordingly; the diameter of the probability space diam(Q) is taken into account. 
As an example, the discrepancy metric can be considered. Given yw and v two probability measures, the distance dp is defined as 
dp(u, v) = SUP all closed spherical neighborhoods | UU (B) — v(B) | (9) 


The Diaconis measure [12] Eq. (9) is of use in the analysis of the experimental errors to be attributed to the committer found in [13] here to be applied within the 


framework of bisimulation metrics (which are illustrated in [4]). 


3.3 Measures for Directed Graphs 
In [8], measure theory for directed graphs is studied. From Lemma 1.1 ibidem, the basic diagrams of finite paths on G are defined 
The study makes use of the method of semigroupoids. The analysis is shown for the needed purpose. The cases of non-basic diagrams on G are 


to apply to o-algebras. reconducted to finite paths. 


The ai of [8] is the definition of a locally bounded positive measure for The use of the free-semi-groupoids allows one to consider the vertices on G. 


directed graphs; as an application, integration with this measure is performed. It is our care to anticipate that the vertices on G will be identified with the 


Given a graph G, the free semi-groupoid is considered. nodes from [1] 


4. New Developments of the Spatial Markov Chain Model of Cancer Progression 
From [1], the probability matrix of the Markov chain Pz (1) is here newly rewritten as a function of the time ¢ with c as a parameter from the new position 


Pr(t)= (10) 


| 


e- 


R(t) 


O 
1 


| 
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It is newly found that there corresponds to a Markov chain of fundamental matrix Or (t) which is here newly written as 


Or (t= 


(11) 


of 0) 
of Rit)) 0 


A(t) =~ o(0), (12a) 
o(R(t)) =O (12b) 


Therefore, it is newly found that 


5. Chain of the Mover-Stayer Model 


The complete mover-stayer model is not a Markov process: the corresponding chain is not assured to be a Markovian one. From the items of information used in 


[9], the fundamental matrix of the chain Om-—s (T) is newly spelled as 


dQin 3(T) . 
—.—__T ' C m-s = 
dT +Q 


The solution of Eq. (13) depends on the matrix M(T). 


dT 


d (P(r) - i) —dP(T) =) 


dT 


The one possible choice of the representation crucially depends on the numerical values of the entries of the matrices in Eq. (4). 


The Markov chain of the first-order Markov process can therefore be extracted accordingly. 


6.0 Allele Manifestation as a Finite Markov Decision Process 

The allele manifestation(s) of cancer states in objects is here newly described 
within the framework of a Markov Decision Process (MDP). 

From [4], the similarity of the states involved in a Markov Decision Process 
(MDP) allows one to introduce (experimental) uncertainties within the 
framework of multistate decision-making’ in the probability spaces. 
Therefore, the manifestation of the allele(s) and the choice of the loci are here 


newly schematized as a MDP. 


In [4], distance functions (and metrics) of the states of the MDP (the 
allele/loci Markov model) are defined. 

Accordingly, the metrics can be made use of in order to ’aggregate’ some 
(chosen) sets of states (alleles) which are here newly proposed to be used in 
several Markov models, i.e. such as (but not necessarily only) Hidden 


Markov State Models and subHidden Markov State Models. 


The time dependence of the allele manifestation of different loci in cancer 
can therefore be studied within the framework of the definition of the 
*similarity of states’ in a finite MDP; the ’similarity’ is understood within the 
numerical values of the standard deviations which characterize the choices of 
the representation of the matrices in Eq. (13); the experimental uncertainties 


are discussed throughout the present paper. 


6.1 Allele loci: States of the MDPs and Related Quantities 
An MDP is a finite set of states W and a finite set of actions A on the states., 
such that, for every pair of states w and w and the corresponding action an 
after which one can define 

e The probability matrix Py"; and 

e A numerical reward r,“ 
The details are to be reviewed from [14]. 


The calculation of the rewards allows one form [4] to find families of 


The Expectation Values of the Reward 


equations that provide one with the definition of the ’values of w. 


7.0 Novel Mathematical Description of the Nodes as Loci of the Allele 
Manifestation and Measure Theory on Directed Graphs 

The analysis of the pathways given in [17] according to which it is possible 
to trace the cancer states and the non-cancer states uniquely allows one to 
interpret such pathways in the state landscape as directed graphs. A 
comprehensive study of directed graphs is presented in [7]. It is here newly 
pointed out, from [8], that the measure of directed graphs is from the o- 
algebra on the subset. It is therefore here newly pointed out that the 
Kantorovich measure applies to directed graphs as follows. From Eq. (9), the 
probabilities of finding cancer states close to non-cancer states are here newly 
analyzed as forming directed graphs (on the opportune manifold). The 


findings of [6] can therefore be applied. 


Furthermore, the notion of ’bisimilarity’ can be introduced after [4]. The 
Kantorovich metric [15] can be applied to the probability functions. The aim 
of the use of the Kantorovich metric is the minimization of the reward. There 
exists, nevertheless, a notion of distances that do not define a metric, such as 


the Kullbach-Leibler distance [16]. 


Fixed-point metrics are used for chains that are not Markovian; the 
Kantorovich metric can be applied in this case. 

Function bounds can be evaluated. 

Error bounds of MDP follow. 

The nodes are here understood as the loci of the allele manifestations. 

The use is here newly proposed of the Kantorovich metric, after which the 
(number of) copies of the nodes allows one to select both the allele and the 
locus of its manifestation. The prospect is indicated for the first-order Markov 
process of the mover-stayer scheme. The notion of bisimilarity is indicated 


in the study of the committee found in [13]. 


The ’values of w from [4] are therefore the expectation of the reward. The algebra here newly defined allows one to specify the one expectation value of the 


reward for the time spent in each chosen subset (in this case, the chosen subset in the state w). 
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More in particular, the expectation value of the reward I is here specified after the Kantorovich measure. Given g an application 


G: w — [0, ~); (14) 
The reward I is calculated as 
+ 
— 15 
LT = Gw ( t) dt ‘ (15) 
O 


In the case the state w is beginning to be populated at the time zo, the time 70 is defined as the ’first hitting time, and the reward I’ is newly specified as 
T 
— (16) 
TO 


in this case, g is at least non-decreasing. 


As an easy example, the choice g = 1 can be taken for cancer states, while g = 0 can be assigned for non-cancer states. 


Outlook Moreover, if the choice to implement time discretization is taken, the 
In the present paper, the allele manifestation in cancer is newly studied as a study of clinical screens within the corresponding available data is newly 
time-continuous chain whose states are in an opportune Borel (sub-)set; the favored, i.e. after the long-standing problem proposed. 


new chain is explicitly analytically written. The allele manifestation is newly 
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